Automatic estimation of formant and voice source parameters using a subspace based algorithm

نویسندگان

  • Chang-Sheng Yang
  • Hideki Kasuya
چکیده

An automatic method is proposed to estimate jointly formant and voice source parameters from a speech signal. A Rosenberg-Klatt model is used to approximate a voicing source waveform for voiced speech, whereas a white noise signal is assumed for the unvoiced. The vocal tract characteristic is represented by an IIR filter. The formant and anti-formant values are calculated from the IIR filter coefficients which are estimated by using the subspace-based system identification algorithm, while an exhaustive search procedure is applied to obtain the optimal source parameter values, where an error criterion is introduced in the frequency domain. An experiment has been performed to examine performance of the proposed method with natural speech. The results show that the source parameters such as open and closure instants estimated by the method is in good agreement with those defined on the electroglottograph signals and the formant values estimated are also accurate.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Aperiodicity control in ARX-based speech analysis-synthesis method

We present an improved algorithm for a robust speech analysissynthesis method based on an auto-regressive with exogenous input (ARX) speech production model proposed previously. The speech analysis-synthesis method is capable of making an automatic estimation of vocal tract (formant) and voice source parameters from a speech utterance, generating accurate formant values even for very high-pitch...

متن کامل

A new speech synthesis system based on the ARX speech production model

In this paper, we present a new formant-type speech analysissynthesis system based on the ARX (Auto-Regressive with Exogenous Input) speech production model. The model consists of cascade formant-antiformant synthesizers driven by a voicing source and an unvoiced turbulent noise source. One of the key features of the proposed method is that we have an algorithm to automatically measure the voic...

متن کامل

Semi-Blind Channel Estimation based on subspace modeling for Multi-user Massive MIMO system

‎Channel estimation is an essential task to fully exploit the advantages of the massive MIMO systems‎. ‎In this paper‎, ‎we propose a semi-blind downlink channel estimation method for massive MIMO system‎. ‎We suggest a new modeling for the channel matrix subspace. Based on the low-rankness property, we have prposed an algorithm to estimate the channel matrix subspace. In the next step, using o...

متن کامل

Joint Estimation of Voice Source and Vocal Tract Parameters as Applied to the Study of Voice Source Dynamics

A novel method is presented for the joint estimation of voice source and vocal tract (formant/anti-formant) parameters from the acoustic speech signal. The method is based on the ARX (auto-regressive with exogenous input) model with a glottal flow waveform as an input to the system. A nonlinear optimization strategy is employed to estimate glottal flow parameters, whereas an extended Kalman fil...

متن کامل

An improved speech analysis-synthesis algorithm based on the autoregressive with exogenous input speech production model

Ding et al. have explored a novel pitch-synchronous speech analysis-synthesis method[1] based on an auto-regressive with exogenous input (ARX) speech production model. This method makes an automatic estimation of the vocal tract (formant) and voice source parameters from a speech utterance. This method, however, has suffered deficiencies in the analysis of a high-pitch voice and the introductio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998